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ABSTRACT 

Infrared images of the Q 0957+561 gravitational lens obtained with the Hubble 
Space Telescope show two large (~5") lensed images of the Zs = 1.41 quasar host 
galaxy. Parts of the host galaxy are doubly-imaged like the quasar, while other parts 
are quadruply- imaged. The distortions of the host galaxy offer the best probe yet 
of the global structure of the lensing potential, which is essential for determining the 
Hubble constant from the measured time delay. The distortions are inconsistent with 
the predictions of previously published lens models, which invalidates those models 
and their implications for Hq. New models show that the distortions finally break 
the long-standing degeneracy between the shape of the lens galaxy and the tidal shear 
contributed by the cluster containing the lens galaxy. The shape of the lens galaxy's 
mass distribution must be remarkably similar to the shape of its luminosity distribution, 
and most models that produce reasonable values for the Hubble constant roughly match 
the observed ellipticity gradient and isophote twist of the lens galaxy. Also, the cluster 
must be non-spherical and produce a relatively small tidal shear. Although there are 
still degeneracies in the lens models that lead to a 25% uncertainty in the derived value 
of the Hubble constant, there are also strong prospects for new observations to further 
improve the constraints and reduce the uncertainties. 



^Based on Observations made with the NASA/ESA Hubble Space Telescope, obtained at the Space Telescope 
Science Institute, which is operated by AURA, Inc., under NASA contract NAS 5-26555. 
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1. Introduction 

Gravitational lenses offer an attractive independent method of determining the Hubble constant 
(Ho) on cosmological scales without the systematic difficulties associated with the local distance lad- 
der (Refsdal 1964, 1966). Six of the more than 50 known gravitational lenses now have time delays 
that can be used to estimate Hq: B 0218+357 (Biggs et al. 1999); Q 0957+561 (Schild k Thomson 
1995; Kundic et al. 1997; Haarsma et al. 1999); PG 1115+080 (Schechter et al. 1997; Barkana 1997); 
B 1600+434 (Hjorth et al. 1999); B 1608+656 (Fassnacht et al. 1999); and PKS 1830-211 (Lovell 
et al. 1998); preliminary Hq estimates for five of these systems have been compiled by Koopmans 
Sz Fassnacht (1999). Once a time delay is accurately determined, the uncertainties in the derived 
value of the Hubble constant are due almost entirely to the systematic uncertainties in the model 
for the lensing potential, plus "cosmic variance" due to the effects of weak density inhomogeneities 
along the line of sight (Seljak 1994; Barkana 1996). 

Only two of the time delay lenses have been modeled in sufficient detail to fully understand the 
systematic uncertainties created by their geometries: Q 0957+561 (Falco, Gorenstein & Shapiro 
1985, 1991; Kochanek 1991; Bernstein, Tyson & Kochanek 1993; Grogin & Narayan 1996; Chartas 
et al. 1998; Barkana et al. 1999; Bernstein &: Fischer 1999; Chae 1999; Romanowsky &; Kochanek 
1999), and PG 1115+080 (Schechter et al. 1997; Courbin et al. 1997; Keeton & Kochanek 1997; 
Saha &; Williams 1997; Impey et al. 1998). In Q 0957+561, the lens consists of a brightest cluster 
galaxy and its parent cluster, and the value of the Hubble constant depends on the mass balance 
between the two components. Judgments about the ability of models to determine the correct mass 
balance vary significantly: optimistic estimates yield Hq = 6IIJ5 km s^"*^ Mpc^^at 95% confidence 
(Grogin &: Narayan 1996, using the stellar dynamical models of Romanowsky & Kochanek 1999); 
while more pessimistic estimates yield Hq = 77^24 (Kochanek 1991; Bernstein et al. 1993; Bernstein 
Sz Fischer 1999). In PG 1115+080, the lens is one of the brighter galaxies in a small group, but the 
four-image geometry easily determines the relative roles of the primary lens galaxy and the group. 
Instead, the value of the Hubble constant depends on the assumed radial mass distribution of the 
primary lens galaxy. Galaxy models with dark matter and mass distributions consistent with the 
best estimates for early-type galaxies lead to low values for the Hubble constant (Hq = 44 + 4 
km s~^ Mpc~^; Impey et al. 1998). 

Impey et al. (1998) discovered an Einstein ring image of the quasar host galaxy in the 
PG 1115+080 lens, and noted that the geometry of the ring could be used to break the degen- 
eracy in the lens models. Here we report the discovery of the host galaxy in the Q 0957+561 
lens and discuss the implications for lens models and the Hubble constant. In §2 we describe our 
observations. In §3 wc discuss the data and methods used to model the system, summarizing pre- 
vious work and introducing our new data and techniques. In §4 wc examine the arcs predicted by 
previously published lens models and demonstrate that they fail to match the observed arcs. In 
§5 we present new models that match all data, including the arcs, and discuss their implications 
for the properties of the lens galaxy and cluster and for Hq. In §6 we summarize our results and 
discuss prospects for further improving the constraints on this lens system and on Hq. 
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2. Observations 

We observed Q 0957+561 with the Hubble Space Telescope as part of the CfA-Arizona Space 
Telescope Lens Survey (CASTLES; Lehar et al. 1999; Falco et al. 1999). Using the NIC2 camera, 
we obtained a 2800-second F160W (H band) image of Q 0957+561 divided into four dithered 
exposures. A log of the observations is presented in Table 1. We reduced the images using nicred, 
a custom reduction package developed for CASTLES (Lehar et al. 1999). We also reanalyzed 
archival WFPC2 images, including a 32200-second F555W (V band) exposure and a 2620-second 
F814W (I band) exposure (Bernstein et al. 1997). In the optical images, Q 0957+561 fell in chip 
WF3 of the WFPC2 camera (with pixel size ~0'.'l, compared to ~0'.'076 for NIC2). Because the 
images A and B of Q 0957+561 were saturated in the WFPC2 images, we derived our astrometry 
of A, B and the lens galaxy Gl exclusively from our unsaturated NIC2 data. 

Figure la shows our combined H band image, which prominently shows the lens galaxy and 
the two quasar images. The quasar host galaxy is visible as a faint arc next to the quasar A image. 
Table 2 summarizes a photometric model consisting of point sources for the two quasar images 
and elliptical de Vaucouleurs models for the main lens galaxy (Gl) and a neighboring galaxy (G2, 
not seen in Figure 1)J^ The lens galaxy is known to have a small ellipticity gradient and isophote 
twist (Bernstein et al. 1997), but we used only a simple elliptical model. Figure lb shows the 
residuals after subtracting the PSF-convolved photometric model from the original image, and 
Figure Ic shows these residuals convolved with a ~0'.'076 (1 pixel) FWHM Gaussian to enhance 
the visibility of low surface brightness features. The residual image shows the host galaxy image 
near quasar A to be an arc distorted tangentially relative to the lens galaxy. It also reveals an 
extended asymmetric arc near the B quasar image. If the residuals near quasar B were created by 
the error of fitting a simple elliptical surface brightness model to a galaxy with a radially varying 
ellipticity and orientation, we would expect them to have reflection symmetry through the center 
of the galaxy. Because the arc near quasar B lacks such symmetry, we conclude that it is the lensed 
counterpart of the arc near quasar A, and that both are images of the quasar host galaxy. 

The residual H band image also shows two additional sources, one east of quasar A and another 
just west of quasar B (labeled G3 and G4 in Figure Ic). Their positions and photometric properties 
are given in Table 2. These objects correspond in position and V magnitude to two of the faint 
sources seen in the deep V band image and labeled Blobs 7 and 1 (respectively) by Bernstein et 
al. (1997). Their colors (especially V— I) are similar to those of Gl and G2 and suggest that they 
are probably faint cluster members. Whether or not they are associated with the cluster, it is clear 
that they are not multiply imaged and are not associated with the quasar source. The fact that G4 
is not part of the host galaxy arc will be important to remember when examining the arc structure 



^Young et al. (1981) tabulated objects in the field of Q 0957+561 and applied tlie labels G1-G5 to five of the 
bright galaxies. We labeled only objects close to the lens galaxy, so other than Gl our labels do not match those of 
Young et al. Our object G2 appears as object #97 in their Table 1. Our objects G3 and G4 do not appear in their 
hst. 
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predicted by lens models (see §3.3). 

The lensed host galaxy is not detected at significant levels in the shallow I band image, although 
a visual examination suggests that low-level residuals axe present and should be detected at a 
significant level with a longer integration. The deep V band image does not have large extended 
arcs, but it does show seven faint "blobs" and a thin arclct (Bernstein ct al. 1997). As noted 
above, at least two of the blobs are probably faint cluster galaxies. However, Blobs 2 and 3 and the 
arclet have been identified as possible lensed features, and lens models support this interpretation 
(Barkana et al. 1999; Bernstein &; Fischer 1999; Chae 1999). Since the V band corresponds to 
rest-frame UV at the quasar redshift Zg = 1.41, these features probably correspond to discrete star 
forming regions in the host galaxy. We follow previous models and include Blobs 2 and 3 plus two 
"Knots" in the arclet as model constraints (see §3.2). 

We estimated the fluxes of the host galaxy images using the IRAF task polyphot, which 
measures fluxes within polygonal apertures. Wc traced polygonal apertures along the edges of 
the distorted images of the host galaxy at the 3a level above the sky. We found that both lensed 
images of the host galaxy have H band brightnesses of ~18.4 mag, and a surface brightness of ^^20. 6 
mag/arcsec^ (both images have approximately the same area). The main source of uncertainty for 
the brightness estimates is the relatively large size of the residuals from the subtraction of the 
quasar images, compared to the (low) brightness levels of the arcs; we estimate the uncertainties 
to be ^0.3 mag. 

Given an acceptable lens model we can map the host galaxy images to the source plane to 
obtain a map of the unlensed host galaxy (see §3.3). In principle we could use the source maps 
to measure the photometric properties of the host galaxy. However, in practice the imperfect 
quasar subtraction corrupts the flux in the bright central regions of the source and hinders the 
measurement. One robust statement we can make is that part of the host galaxy is doubly-imaged 
like the quasar, but part of it is quadruply-imaged (see §4). This accounts for the shape differences 
between the A and B arcs: the A arc is a single distorted image of the host galaxy, while the B arc 
is a composite of three images that straddle the lensing critical line. 

3. Constraining models of Q 0957+561 

Q 0957+561 is the most thoroughly studied gravitational lens. The system comprises a radio- 
loud quasar at redshift Zg = 1.41 lensed into two images by a brightest cluster galaxy and its parent 
cluster at redshift zi = 0.36 (Walsh, Carswell &; Weymann 1979; Young et al. 1980). There is a time 
delay of 417 ± 3 days between the images (Schild &; Thomson 1995; Kundic et al. 1997; Haarsma 
et al. 1999). VLBI observations have resolved each image into a core and ~80 milli-arcsecond jet 
(Garrett et al. 1994). Deep optical images have uncovered faint "Blobs" and "Knots" that are 
probably lensed image pairs of star forming regions in the quasar host galaxy (Bernstein ct al. 
1997). Our observations have revealed infrared arcs representing distorted images of the quasar 
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host galaxy. In this section we discuss how to use these and other data to constrain models of the 
system and values for the Hubble constant Hq. In §3.1 we review the theory of lens modeling and 
discuss important model degeneracies. In §§3.2 and 3.3 we discuss previous and new observational 
constraints on models. In §§3.4 and 3.5 we describe classes of models applied to the lens galaxy 
and cluster in Q 0957+561, summarizing previous classes and introducing a new one. Finally, in 
§3.6 we discuss how we apply the observational constraints to our new class of models. 



3.1. Basic lens theory: model degeneracies 

In a multiply-imaged gravitational lens system, the light from a distant source is deflected by 
the gravitational potential of foreground objects so that we observe multiple images of the source. 
The lensing potential is usually dominated by a single galaxy, although there may be non-negligible 
perturbations from other objects nearby.^ Basic lens theory is presented in the book by Schneider, 
Ehlers & Falco (1992), and we quote the relevant results here. The lensing potential (p is determined 
by the two-dimensional Poisson equation V^t/) = 2k, where k = Y^/Ticrit is the surface mass density 
in units of the critical surface density for lensing (in angular units), 
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where Dot and Dqs are angular diameter distances from the observer to the lens and source, respec- 
tively, and Dig is the angular diameter distance from the lens to the source. The lensing potential 
deflects a light ray so that the angular position u of the source on the sky and the angular position 
X of an image are related by the lens equation, 

u = x- V(j){x) . (2) 

There is an image corresponding to each solution Xi of this equation. Lensing introduces a time 
delay between the ray paths of two images of the same source. The time delay between images at 
positions Xi and Xj is 
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where zi is the redshift of the lens. This equation is the basis of attempts to use lensing to determine 
the Hubble constant Hq. By measuring light curves of images one can determine the time delay 
Atij. A lens model gives the term in square brackets in eq. (|3|). The combination of distances is 
oc Hq^ and only weakly dependent on other cosmological parameters. 



^There may also be a contribution to the potential from density fluctuations along the line of sight, but it is usually 
small compared to the contribution from objects at the same redshift as the main lens galaxy (Seljak 1994; Barkana 
1996; Keeton, Kochanek & Seljak 1997). In a few cases there are two lens galaxies enclosed by the lensed images 
(e.g. Jackson, Nair & Browne 1997; Koopmans et al. 1999), but this complication is not an issue in Q 0957+561. 
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A lens model consists of a description of the lensing potential ^. The observed images provide 
two principle constraints on such a model. First, roughly speaking an image at projected distance 
R from the main lens galaxy measures the enclosed mass M{R). In Q 0957+561 and many 2-image 
gravitational lenses, the two images lie at different distances Ri / -R2) so they measure two masses 
M{R\) and M(i?2) and hence constrain the mass profile (e.g. Grogin & Narayan 1996). Second, 
with at least three well-determined positions (two images and the lens galaxy, or four images) , the 
images determine the quadrupole moment of the net potential. However, there are four properties 
of the mass distribution that must be determined for a complete description of the model: the mass 
profile of the main lens galaxy; the shape (ellipticity and orientation) of the main lens galaxy; the 
shear from the the gravitational tidal field induced by objects near the lens galaxy; and the amount 
of gravitational focusing (or "convergence" ) contributed by the environment of the lens galaxy. 

With more unknown quantities than constraints, there are two common degeneracies in the 
lens models. First, to lowest order the convergence n contributed by the environment cannot be 
determined by lens models, which leads to the so-called "mass sheet degeneracy" (Falco et al. 1985). 
If a lensing potential (f){x) fits the observed data, then any potential 

(^'{x) = ^k\x\'^ + K)<t){x) (4) 

will fit the data equally well. The n term is equivalent to the potential from a uniform mass sheet 
with surface density E = k Scrit- The only potentially observable difference between the lens models 
represented by and ^' is in the predicted time delays, 

^'t'model = (!-«) ^tmodel ■ (5) 

However, this effect is not observable if we want to use gravitational lensing to determine the Hubble 
constant because it simply translates into a scaling of the inferred value for Hq. If Hq and H'q are 
the values inferred from the two lens models, then 

H'^ = {1-k)H^. (6) 

This analysis has used only the lowest order term of the contribution from the environment, but the 
higher order terms cannot necessarily eliminate the degeneracy (e.g. Chae 1999). The mass sheet 

degeneracy is important for Q 0957+561 because the parent cluster of the lens galaxy contributes 
a significant convergence. The only way to break this degeneracy is to obtain an independent mass 
constraint to determine the relative contributions of the main lens galaxy and the environment to 
the mass enclosed by the images (see §3.2). 

The second important degeneracy is between the shape of the main lens galaxy and the shear 
from the gravitational tidal field of objects in the environment of the lens galaxy. Because the 
images determine only the quadrupole moment of the total potential, there can be a wide range of 
parameter space in which the lens galaxy ellipticity and the external shear combine to produce the 
required joint quadrupole (e.g. Keeton et al. 1997). Consider a 2-image lens with image positions 
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XA and xb, a galaxy potential <j)gai, and an external shear with amplitude 7 and direction 6^. 
Fitting the image positions exactly is equivalent to solving the equation 

XA - V(j)gal{xA) - T ■ XA = XB - V(j)gal{xB) - T ■ Xb (7) 

where the shear is described by the tensor 

^ ^ 7 cos 20^ 7 sin 20^ 
7 sin 20^ —7 cos 20^ 

No matter what galaxy potential (j)gai is used, it is straightforward to solve eq. for the shear 
parameters 7 and 0y. In other words, one can fit the image positions in a 2-image lens to arbitrary 
precision for any model lens galaxy, although in some cases the inferred shear 7 may be unphysically 
strong. 

Constraints from the images fluxes or additional images often break this degeneracy, although 
the la range of models may still be large. In previous models of Q 0957+561, the additional 
constraints came primarily from the VLBI observations of the quasar images, which show several 
discrete components with ~0.1 milli-arcsecond errorbars (Garrett et al. 1994; Barkana et al. 1999). 
It is easy to be misled about the true structure of the potential, however, when using such strong 
constraints (see Kochanek 1991; Bernstein et al. 1993; Bernstein &: Fischer 1999). Internal structure 
in the lens galaxy, such as isophote twists or lumpiness in the mass distribution, may affect the 
images at sub-milli-arcsecond scales (e.g. Mao &: Schneider 1998). Neglecting that structure forces 
the models to adjust large-scale model components in order to fit small-scale constraints, leading 
them to converge to a best-fit solution that is well defined (i.e. with no apparent degeneracy) but 
incorrect. In Q 0957+561, the observed ellipticity gradient and isophote twist suggest that the 
galaxy does have important internal structure (Bernstein & Fischer 1997), and in §4 we show that 
models neglecting this structure indeed converged to incorrect solutions. 



3.2. Previous observational constraints 

In Q 0957+561 the high-resolution VLBI maps of the quasar images provide strong position 
constraints, plus somewhat weaker constraints on the relative magnification matrix between the 
images (Garrett et al. 1994; Barkana et al. 1999). The optical Blobs and Knots offer additional but 
weaker position constraints. The Knots are particularly useful because they appear to represent a 
pair of "fold" images and thus require the lensing critical line to pass between them (see Bernstein 
& Fischer 1999). The constraints from these images leave two of the three model degeneracies 
discussed in §3.1: the mass sheet degeneracy due to the convergence provided by the cluster; and 
the degeneracy between the lens galaxy shape and the cluster shear. 

Several steps have been taken to break the mass sheet degeneracy by determining the relative 
masses of the lens galaxy and cluster. First, Rhee (1991), Falco et al. (1997), and Tonry & Franx 
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(1999) measured the lens galaxy's central velocity dispersion to be 288 ± 9 km s~^. Romanowsky 
& Kochanek (1999) used stellar dynamical models to translate this into constraints on the lens 
galaxy mass and concluded that (80 ib 12)% of the image separation is contributed by the galaxy 
(and the rest by the cluster convergence). Second, Fischer et al. (1997) detected a weak lensing 
signal from the cluster, which Bernstein k. Fischer (1999) translated into an estimate of the mean 
surface density of all mass inside an aperture of radius 30" centered on the lens galaxy, 

{i^ohs)^^'! = (S)3Q///Ecrit = 0.26 ± 0.08 . (9) 

Furthermore, Bernstein & Fischer (1999) showed that the surface mass density KcIus — ^clus/^crit 
of the cluster at the position of the lens galaxy is given by 

where {nmod) is the mean mass density of a model lens galaxy. In this way one can combine the 
weak lensing measurement with a lens model to estimate Kdus (see §5.3). There have been other 
attempts to measure the cluster mass, using the velocity dispersion of the galaxies in the cluster 
(715±130 km 8"^ Garrett, Walsh & Carswell 1992; An gonin-Willaime, Soucail & Vanderriest 1994) 
or the X-rays from the cluster gas (Chartas et al. 1998), but they are still limited by significant 
systematic uncertainties. 

Although measurements of the cluster surface density are important (especially for determining 
Hq, see §5.4), they do not break the degeneracy between the lens galaxy shape and the cluster shear. 
Another approach is to constrain the shape of the lens galaxy's mass distribution. The orientation of 
the lens galaxy's light distribution should offer a guide to the orientation of its mass distribution, 
because other lenses suggest that mass and light are at least roughly aligned (typically within 
^^10°; Keeton, Kochanek & Falco 1998). An important detail, though, is that the lens galaxy has 
a radially varying ellipticity and orientation (Bernstein et al. 1997): the cllipticity increases from 
~0.1 inside a radius of 1" to ~0.4 outside 10", and the PA varies from ~40° to ~60° (albeit with 
large uncertainties). These variations suggest that the galaxy's projected mass distribution may 
not have simple elliptical symmetry, and they may well need to be incorporated into lens models. 



3.3. New constraints from the host galaxy arcs 

The host galaxy arcs act as an extensive set of position constraints that may break the degener- 
acy between the lens galaxy shape and the cluster shear. Although the intrinsic source structure is 
unknown, the simplicity of the lensing geometry means that we can model the arcs self-consistently 
using a modified version of the Ring Cycle algorithm of Kochanek et al. (1989). Specifically, we 
know that there is a one-to-one mapping between the A image of the host galaxy and the source, 
so for a given lens model we can project the A arc back to the source plane to obtain the correct 
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source structure for that lens model.^ We can then reproject that source onto the image plane to 
predict the structure of the B arc. Since the arcs are very large compared to the size of the PSF, 
there is no need to include the effects of the PSF. For the arc map we use the smoothed residual 
image (Figure Ic). 

This technique produces maps of the model arcs that can be compared visually to the observed 
arcs (see Figures 2-4 below). However, for modeling we want to quantify the differences, so we 
define a term for the arcs using a pixel-by-pixel comparison of the observed and model surface 
brightnesses, 

Narc / f f \2 

2 \Jobs,i Jmod,i) 

Xarc ~ 2-^ 0Z2 ■ ^ 

The factor of 2 in the denominator enters because the source is constructed from arc A, so it and 
the observed B arc each have errorbars of c^gj^y. To compute Xarc mask the arc map and use 
only the region near the B arc, omitting regions near the B quasar and the core of the lens galaxy 
where imperfect subtraction makes the residual flux unreliable. We also omit the region around 
the object G4 where the flux is not due to the quasar host galaxy (see §2). The masked region is 
shown in Figure Ic. To count the number of arc constraints, we count the number of pixels inside 
the mask for which either the observed arc or the model arc is more than 2a above the sky and 
take this to be the number of arc constraints Narc- Typically Narc ~ 10^. Because the source is 
fully determined from the A arc, we quote Xarc = xi.rc/^arc as an effective x^/DOF for the arc. 
The fact that our best models produce Xarc — 1 (see §5) suggests that this is a reasonable counting 
of the constraints. 



3.4. Lens galaxy models 

Q 0957-1-561 has been modeled extensively; we summarize results from the recent models by 
Grogin & Narayan (1996), Barkana et al. (1999), Bernstein & Fischer (1999), and Chae (1999) in 
Table 3. (Technical details of comparing the models are discussed in Appendix A.) The models 
fall into two main families based on how they treat the main lens galaxy. First is the "power 
law" family, in which the lens galaxy is modeled with a circular or elliptical surface density with a 
softened power law profile: E oc (s^ -|- m?)'^^'^ ^, where m is an ellipsoidal coordinate, s is a core 
radius, and a is the power law exponent such that M{R) ~ asymptotically. The best-fit models 
typically have a small core radius and a power law a ~ 1.1 corresponding to a profile slightly 
shallower than isothermal. In the second family, called "FGS" models after being introduced by 
Falco, Gorenstein & Shapiro (1991), the lens galaxy is treated as a circular or elliptical King model, 
plus a central point mass to account for a mass deficit in the core of the King model. 



similar projection fails for the B arc because in all reasonable models this arc crosses the lensing critical line 
(see §4). Thus the map from the B arc to the source is not one-to-one. 
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in almost all of the previous models the lens galaxy was assumed to have a projected mass 
distribution with circular or elliptical symmetry. However, as discussed in §§3.1 and 3.2 the observed 
ellipticity gradient and isophote twist suggest that this assumption may not be correct. The key 
improvement in the models is the addition of internal structure in the lens galaxy's mass comparable 
to that seen in its light. While we do not expect the mass to trace the light exactly, it should be 
given the same freedoms in order to avoid oversimplifying the models. 

Bernstein k. Fischer (1999) did add freedom to the galaxy's radial and angular structure by 
using independent power laws in different radial zones, but this led to unphysical models with 
density discontinuities that cannot match normal rotation curves. We introduce models that allow 
similar radial and angular freedom while keeping the density smooth. We start with the pseudo- 
Jaffe ellipsoid, whose projected density distribution has elliptical symmetry and a profile that is 
roughly flat inside a core radius s, falls as S oc out to a cut-off radius a (yielding a rotation 
curve that is approximately flat), and then falls as S oc to maintain a finite mass; a detailed 
definition is given in Appendix B. We then construct "double pseudo-Jaffe" models comprising 
two concentric pseudo-Jaffe components with different scale lengths, ellipticities, and orientations. 
We fix the model galaxy to its observed position, which leaves 10 galaxy parameters: a mass 
parameter (6j), ellipticity (e^), orientation angle (PAj), core radius (sj), and cut-off radius (oj) for 
each component {i = 1,2). 

The double pseudo-Jaffe model provides a great deal of freedom in the lens galaxy. First, 
it is essentially a smooth generalization of both the power law and the FGS models. When the 
inner pseudo-Jaffe component is compact, it mimics the point mass in FGS models while the outer 
component is similar to the King model (see Appendix B); and when the cut-off radius of the inner 
component is comparable to the core radius of the outer component, the two components combine 
to produce nearly a smooth a = 1 power law. Second, with this model we can mimic the observed 
internal structure of the lens galaxy: by adjusting the ellipticities and orientations we can produce a 
smooth ellipticity gradient and isophote twist. An important difference from the broken power law 
models of Bernstein k. Fischer (1999) is that the double pseudo-Jaffe models have smooth density 
profiles. We discuss the physical properties of sample double pseudo-Jaffe models in §5.3. 

3.5. Cluster models 

Previous models have used several different methods for including the cluster's contribution to 
the lensing potential. The simplest approach is to expand the cluster potential in a Taylor scries 
and keep only the lowest significant (2nd order) term, which describes the tidal shear produced by 
the cluster. However, this approximation is thought to be poor for Q 0957-1-561 because the cluster 
is close to the lens and the VLBI errorbars are small, so the 3rd order terms are larger than the 
errorbars. Barkana et al. (1999) and Chae (1999) included all the higher order terms implicitly 
by introducing a mass distribution to represent the cluster, but degeneracies related to the cluster 
shape and density profile forced them to make assumptions about the cluster properties. Kochanek 
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(1991) and Bernstein &; Fischer (1999) instead used a Taylor series with general 3rd order terms, 

(pclus = \ Hclus r^ + \ir^ cos 2{d -Q^) + -^ar^ sin(^ -Qa)-\^r^ sin - 63) . (12) 

The 2nd order Kdus term produces the mass sheet degeneracy, so it is usually omitted from the 
lens models and constrained independently (sec §§3.1 and 3.2). The 2nd order 7 term represents 
the tidal shear from the cluster. The 3rd order a and 5 terms arise from the gradient of the cluster 
density and the m = 3 component of the cluster mass (in a frame centered on the lens galaxy; see 
Bernstein &: Fischer 1999; Keeton 2000). The three direction angles {6^,9(^,9$) are written here as 
position angles (i.e. measured East of North). 

We follow Bernstein &: Fischer (1999) and use the 3rd order Taylor series (omitting the Kdus 
term). Although we simply fit for the Taylor series parameters, it is important to understand 
how they relate to physical properties of the cluster and what their reasonable ranges are. In 
popular cluster mass models the cluster amplitudes (7,(7, (5) and direction angles {0j,9a-,9s) have 
the following properties (Keeton 2000): 

• Circularly symmetric mass distribution: the three angles all point to the center of the cluster. 

• Singular isothermal ellipsoid: the shear angle 9^ points to the cluster center and the shear 
amplitude equals the convergence from the cluster, 7 = Kdus- Both of these results hold 
for all positions and all values of the axis ratio q. The gradient amplitude a is bounded by 
1 < < where ro is the distance from the lens galaxy to the cluster. 

•^clus ■^y 

• Softened isothermal ellipsoid: Outside of the core the results for the singular isothermal 
ellipsoid are approximately true. For example, for clusters with an axis ratio q > 0.6 the 
shear angle 9^ points to within 10° of the cluster center for rg ^ 2s, where s is the core 
radius. The cluster has Kdus > 7; but Kdus > 27 only for ro ^ 3s. 

• An ellipsoid with the "universal" dark matter profile of Navarro, Frenk &: White (1996): For 
clusters with an axis ratio q> 0.6, the shear angle 9j points to within 4° of the cluster center 
for ro ^ 0.5rs, where rg is the scale radius in the "universal" profile. The cluster generally 
has Kdus 7 for ''o k, ^s and Kdus ^ 7 for ro^rg. 

We let the shear and gradient parameters (7, 9^, a, 9^) vary freely, but we keep these general relations 
in mind when interpreting the results. To avoid an explosion of parameters, we follow Bernstein 
k, Fischer (1999) and use a "restricted" 3rd order cluster with 9^ = 9^ and 5 = —3(7/2 (as for a 
singular isothermal sphere). This model is useful because it requires only four parameters for the 
cluster, and it can provide model- independent evidence for a non-circular cluster if 9a ^ 9^. Wc 
also experimented with models using a singular isothermal mass distribution for the cluster, and 
we discuss these models briefly but do not quote detailed results (§5). 
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3.6. Constraining the new models 

Our models have 10 galaxy and 4 cluster parameters, but fortunately we need not examine all 
of them explicitly. First, the general features of the lens allow us to fix two scale radii. The lack of 
a central image requires that the galaxy be nearly singular, so we set the core radius of the inner 
pseudo-Jaffe component (si) to zero. Also, previous lens models suggest that the galaxy needs to 
have mass extending at least to the distant A image (5" from the galaxy), so the cut-off radius of 
the outer pseudo-Jaffe component (02) should be larger than 5". The model should not be very 
sensitive to any particular value larger than this, so we fix 02 = 30" (similar to Bernstein & Fischer 
1999). 

Second, we can use linear techniques for the two galaxy mass parameters (61 and 62) and 
the two cluster amplitudes (7 and a) because they enter the potential as simple multiplicative 
factors. (This is a variant of solving for the shear parameters in eq. |^.) If we take the strong 
VLBI constraints on the quasar core and jet positions to be exact (see Kochanek 1991; Bernstein 
& Fischer 1999), they lead to four linear constraint equations: 

cores: xai - V</>(xai) = xbi - V(/)(fBi) (13) 
jets : f A5 - V(p{xA5) = XB5 - V(/)(fB5) (14) 

where Ai and Bi denote the two quasar cores while A5 and B5 denote the brightest jet components; 
the positions are given by Barkana et al. (1999). The four constraint equations can be solved 
explicitly to write the four linear parameters as functions of the remaining (non-linear) parameters. 
This technique ensures that the VLBI constraints are fitted exactly, while reducing by four the 
number of parameters that must be examined directly. 

We are left with 8 explicit parameters: the ellipticities ei and 62, orientation angles PAi and 
PA2, and scale lengths ai and S2 of the galaxy; and the direction angles 9^ and 6u of the cluster. 
We evaluate models in this parameter space using the remaining constraints: the flux ratios of the 
quasar cores and jets and the positions of the optical Blobs and Knots (taken from Bernstein &: 
Fischer 1999), and the structure of the arcs (see §3.3). Techniques and results for our new models 
are discussed in §5. 

4. The failure of existing lens models 

Previous models of Q 0957-1-561 (summarized in Table 3) posited that the VLBI observations 
and the optical Blobs and Knots provided strong enough constraints to break the degeneracy 
between the lens galaxy shape and the cluster shear. As we discussed in §3.1, however, it is 
dangerous to use smooth circular or elliptical lens models with high-precision constraints, because 
the oversimplified models may be forced to converge on a best-fit solution with the wrong global 
structure for the potential. Until now we have lacked the constraints to test this concern, although 
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the fact that previous models gave wildly difference shapes for the lens galaxy certainly suggests 
that the models were not robust. 

The host galaxy arcs finally provide an extensive set of constraints for testing the previous 
models. The first clue comes from comparing the qualitative features of the arcs and the existing 
models. Despite differences in details, most of the previous models have a Icnsing potential with 
a strong tidal shear from the cluster (7 ~ 0.1-0.4). The galaxy dominates the potential for the 
close image (B), but the strong shear means that the cluster dominates the potential for the distant 
image (A). The distinction is important because in general a round host galaxy produces an image 
distorted tangentially relative to the mass that dominates the potential. As a result, for a circular 
host galaxy most previous models of Q 0957+561 would predict a B arc tangential to the galaxy 
(as observed), and an A arc tangential to the cluster - i.e. stretched radially relative to the lens 
galaxy, opposite what is observed. These lens models can produce an A image tangential to the 
galaxy only if the source galaxy is highly flattened and oriented at just the right angle. In other 
words, we must either invoke a very special source configuration or conclude that the standard 
Q 0957+561 models have generic problems. 

We can quantify these problems by using the arc modeling technique described in §3.3 to 
predict the structure of the B arc for each model and compare it to the observed B arc. (Recall 

that this technique uses the observed A arc to construct the source, so models always reproduce 
the A arc correctly and hence are evaluated by examining the B arc.) Figures 2 and 3 show the 
observed and predicted models arcs for previous power law models, while Figure 4 shows the arcs 
for FGS models; Figure 5 shows the intrinsic source inferred for two of the models. Table 3 includes 
the quantitative Xarc estimates for the models. 

All of these models are inconsistent with the structure of the B arc, both visually and in 
terms of Xarc- Although the predicted arcs differ among the various models, there are two common 
features. First, in some of the models (notably those with a circular lens galaxy) the predicted B 
arc has a pair of bright ridges not seen in the observed B arc. Models that produce such ridges 
are strongly inconsistent with the data; they have Xarc statistics no better than 5.0 and as poor 
as 10.8. Second, the models generally fail to predict the correct shape for the B arc at low surface 
brightness levels. The predicted arcs have too much curvature and do not match the northeast 
extension of the observed arc. The failure to match even the rough shape (extent and curvature) 
of the B arc illustrates the point explained above: if the potential near the A image is dominated 
by the cluster shear, reproducing the A arc requires a flattened source (e.g. the Barkana/SPEMD 
model in Figure 5) that is distorted into a highly curved B arc. The existing models cannot match 
the arcs better than Xarc ~ ^-2) which given the large number of arc constraints means that these 
models are formally excluded at an extremely high significance level. 

The models arcs also illustrate a crucial limitation of the broken power law models of Bernstein 
Sz Fischer (1999), represented here by the DM2+C2 model in Figure 3. These models use indepen- 
dent elliptical power law models in different radial zones to mimic the ellipticity and orientation 
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variations in the observed lens galaxy. The problem is that the density is discontinuous across 
the zone boundary, which leads to a discontinuity in the predicted B arc. The discontinuity does 
not affect the quantitative Xarc statistic because it occurs outside our mask, but it is qualitatively 
unacceptable. Combining this with the inability of the broken power law models to match normal 
rotation curves emphasizes that the models are physically unacceptable. The need for a more phys- 
ically reasonable way to include ellipticity and orientation variations motivates our introduction of 
the double pseudo-Jaffe models. 

Although the previous models do not correctly reproduce the distortions of the host galaxy, 
they do reveal a model-independent qualitative feature of the lensing. Part of the quasar host 
galaxy is doubly-imaged like the quasar, but a small region crosses the lensing caustic and is 
quadruply-imaged (see Figure 5). Arc A is a single distorted image of the host galaxy, but arc B 
is a combination of the three remaining images of the quadruply-imaged and the one remaining 
image of the doubly-imaged region. This fact explains why the A and B arcs have such different 
geometries. It also implies that with the current observations of the arcs, most of the constraints 
come from the small region of the host galaxy that is quadruply-imaged. A deeper image of the 
arcs should reveal a complete Einstein ring that would strengthen the constraints by using more of 
the host galaxy (see §6). 

In summary, the host galaxy arcs in Q 0957+561 offer a strong new probe of the global shape 
of the lensing potential, and they show that existing lens models converged to the wrong potential. 
Generically, the previous models have too much shear from the cluster. The failure to fit the arcs is a 
common problem of models that combine strong constraints from the sub-milli-arcsecond structure 
of the quasar jets with oversimplified circular or elliptical lens models. It rules out existing models 
of the system, together with the bounds on Ho drawn from them. 

5. Successful new models 

Two goals motivate our search for new models of Q 0957-1-561. First, if we want to use the 
VLBI constraints with sub-milli-arcsecond precision, we must be sensitive to details of the lens 
galaxy structure such as its radially varying ellipticity and orientation. We used double pseudo- 
Jaffe models (see §3.4) to incorporate such structure in a smooth and physically reasonable way. 
Second, we want to incorporate the powerful constraints from the host galaxy arcs into the modeling 
process, instead of using them for a posteriori tests as in §4. In this way we hope to break the 
environment degeneracy in a robust way. When we began to examine new models we found a range 
of solutions consistent with the data, so we adopted Monte Carlo techniques to sample this range. 
In this section we first describe the Monte Carlo techniques (§5.1), and then discuss the models and 
their implications for breaking the degeneracy between the galaxy and cluster (§5.2), the physical 
properties of the lens galaxy and cluster (§5.3), and the Hubble constant (§5.4). 
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5.1. Monte Carlo techniques 

To explore our model space, we picked random values for the non-linear model parameters, 
using restricted but physically-motivated ranges: 

• Galaxy ellipticity: There are no successful models with an outer ellipticity below 0.3. We 

believed models with an ellipticity larger than 0.7 to be implausible. Hence we considered 
0.3 < 62 < 0.7 for the outer ellipticity. In models with an ellipticity gradient the inner 
ellipticity can be small, so we considered < ei < 0.7. 

• Galaxy orientation: In the observed lens galaxy the PA varies from ~40° to ~60° (Bernstein 
et al. 1997). We expect the mass to be roughly aligned with the light (to within ~10°; Keeton 
et al. 1998), so we considered 30° < (PAi, PA2) < 70° for the inner and outer position angles. 

• Galaxy scale lengths: We fixed the inner pseudo-Jaffe component to be singular (si = 0) 
and the outer component to have a cut-off radius a2 = 30" (sec §3.6). For the cut-off radius 
of the inner component (ai) and the core radius of the outer component (52) we considered 
O^'l < (ai,S2) < 4". 

• Cluster direction angles: Galaxy counts and the weak lensing measurement suggest that the 
cluster has its mass concentration toward 9dus ~ 55° (Fischer et al. 1997). For reasonable 
cluster models the shear angle 9^ points roughly to the cluster center (see §3.5), so we con- 
sidered shear angles in the range 30° < O-y < 70°. We were less restrictive with the gradient 
angle 0^ and required only that it be in the same quadrant as the cluster, 0° <d^ < 90°. 

Given values for these parameters, we fixed the four remaining parameters (61, 62, 7, a) using the 
constraints from the quasar core and jet positions (see §3.6). Our limited parameter ranges may 
omit some models that are formally consistent with the data, but they span what we expect for 
physically reasonable models. 

We tabulated models that fit the data at the 95% confidence level. With N = (2, 4, 6) con- 
straints this is equivalent to < (5.99,9.49,12.59) (e.g. Press et al. 1992, §§6.2 and 15.6). We 
applied these thresholds to the constraints from the quasars and the optical Blobs and Knots both 
separately and jointly: we required Xjiux — ^-^^ ^^'^ ratios of the quasar cores and jets 

(2 constraints), Xbk — ^-49 for the positions of the optical Blobs and Knots (4 constraints), and 
Xfiux + Xbk — 12.59 for the 6 joint constraints. We did not impose constraints from the less reliable 
Blob and Knot flux ratios. We considered different thresholds for the arc constraints, as discussed 
below. Note that it was not useful to combine all the constraints into a total x^/DOF because the 
number of constraints from the arcs is so large. 

The Monte Carlo technique is not especially efficient; we examined ~ 10^ models and found 
of order 1000 models consistent with the data. However, examining that many models is not 
prohibitively time consuming. Most of the models can be ruled out quickly because they fail the 
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flux and Blob/Knot cuts. For the remaining models, we first computed a fast Xa^c using a 4 
times undersampled arc map; only for promising models did we compute xirc using the full arc 
map. The benefit of this brute force approach is the ability to identify and sample the wide range 
of models consistent with the data, and thereby estimate the full range of possible Hq values. 

We first considered models in which the inner and outer components of the model galaxy 
were fixed to have the same ellipticity and orientation; we found (3, 275, 832, 1575) models with 
xirc < (1.2,1.3,1.4,1.5), whose properties are summarized in Figure 6. Next we allowed the 
inner and outer galaxy components to have different shapes, mimicking ellipticity and orientation 
gradients. For simplicity, we refer to such models as having a "twist," even though that term 
formally describes only an orientation gradient. We found (25, 134, 286, 477) twist models with 
Xarc < (1-2, 1.3, 1.4, 1.5), whose properties are summarized in Figure 7. Table 4 gives parameters 
and Figure 8 shows the predicted arcs for sample models of both types. We note that the number 
of successful models with a twist is smaller than the number of successful models without a twist. 
Introducing the twist enlarges the parameter space without necessarily increasing the space of 
successful models by the same amount; hence sampling the same total number of models yields 
fewer acceptable models. In the following sections we discuss models with Xarc ^ut our 

conclusions would not change substantially if we lowered this threshold. 



5.2. Breaking the galaxy/cluster degeneracy 

The host galaxy arcs finally break the degeneracy between the lens galaxy shape and the cluster 
shear in Q 0957+561. First, they constrain the lens galaxy shape. Among the previous models, 
the three with the best fits to the arcs have 62° < PA < 67° (Table 3). In the new models, the 
bounds on the (outer) orientation of the lens galaxy are 55° < PA < 66° in models without a twist 
(Figure 6d) and 52° < PA2 < 68° in models with a twist (Figure 7f). These ranges are consistent 
with the observed galaxy's orientation of about 56 + 8° (Bernstein et al. 1997). In other words, the 
arcs not only constrain the model galaxy's orientation, they require that its mass distribution be at 
least roughly aligned with the light distribution; such alignment is also seen in other lens systems 
(Keeton et al. 1998). Also, in the new models the bounds on the (outer) ellipticity of the galaxy 
are 0.38 < e < 0.61 in models without a twist (Figure 6c) and 0.35 < 62 < 0.63 in models with a 
twist (Figure 7e). These ranges are slightly higher than the ellipticity seen in the outer parts of the 
galaxy (Bernstein et al. 1997), but there is no a priori reason to expect the ellipticities of the dark 
halo and the light to be similar. These constraints on the lens galaxy shape span models with a 
wide range of lens galaxy mass profiles (controlled by the scale radii ai and S2; see Figures 6b and 
7b), ellipticity profiles (with or without a twist), and cluster contributions. Thus we believe that 
they are robust, and that the ability of the host galaxy arcs to finally constrain the galaxy shape 
is the most important result of our new models. 

Second, the arcs place limits on the lensing contribution of the cluster. The new models have 
moderate or even small cluster shears (typically 7 ^ 0.05; Figures 6e and 7g), which contrasts 
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with the strong shears in previous models. A small shear is required by the fact that both arcs 
are tangential to the lens galaxy (see §4). Also, including the 3rd order cluster term is important, 
as expected from the proximity of the cluster to the lens (Kochanek 1991). Most models have a 
cluster gradient amplitude in the range 0.006 < a < 0.016 (Figures 6g and 7i), so at the position of 



image A the 3rd order term is comparable to or even larger than the 2nd order term (see eq. 12). 
Because the cluster amplitudes are small, the corresponding direction angles 9^ and 9^ are poorly 
constrained (Figures 6f, 6h, 7h, and 7j). Nevertheless, the fact that they usually differ implies that 
the cluster cannot be spherical (see §3.5). 

Unfortunately, even the arcs cannot uniquely determine the model. First, the cluster angles 
can adjust to accommodate the narrow but finite range of lens galaxies given above. Second, in 
models with a twist the inner component can take on all values of ellipticity and orientation in the 
ranges we allowed (Figures 7c and 7d). This is because the inner component is usually compact, 
so like the point mass in FGS lens models it serves mainly to correct a central mass deficit and the 
model is largely insensitive to the distribution of that mass. Both of these degeneracies affect the 
potential enough to produce a disappointingly wide range of Hubble constant values (see §5.4). 

These conclusions are drawn from models using a Taylor series for the cluster potential. We 
also examined models treating the cluster using an elongated isothermal model with an arbitrary 
position, axis ratio, and orientation.^ The above conclusions about the galaxy properties still 
pertain, with one modification: in models with a twist the distribution of outer orientation angles 
gains a tail down to PA2 ~ 40°. The above conclusions about the cluster contributions also hold, 
except that the new bounds on the shear amplitude are 0.03 < 7 < 0.11. Since an isothermal 
cluster has 7 Kdus^ it is hard to obtain a small shear without essentially eliminating the cluster. 



5.3. Properties of the galaxy and cluster 

We introduced the double pseudo-Jaffe models in order to smoothly include ellipticity and 
orientation variations in the lens galaxy, but we must ask whether the models are physically plau- 
sible. Figure 9 shows the estimated density and circular velocity profiles for the models in Table 
4. These profiles are only estimates because they require the full 3-d mass distribution, while 
lens models give only the projected distribution. We assumed that the intrinsic distributions are 
oblate spheroids viewed edge-on, and we computed the density and circular velocity profiles in the 
equatorial plane, neglecting any twist. The high inferred circular velocities {vc ~ 600 km s^^) are 
misleading, because our models omit any convergence from the cluster and Vc oc {1 — KdusY^"^ ■, and 
because converting to an observed velocity dispersion involves systematic uncertainties in the stellar 
dynamics (see Romanowsky & Kochanek 1999). The qualitative features, however, are plausible. 
The inner and outer galaxy components generally combine to produce a rotation curve that is ap- 



^For technical reasons, the isothermal cluster models we used had only approximate elliptical symmetry. 
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proximately flat, although in several models it turns up at small radii because the inner component 
of the pseudo-JafFe models is compact. This effect, which also appears in FGS models because 
of the point mass, occurs because lensing constrains only the total mass enclosed by the B image 
(rs = I'-'O), and if that mass is compact then it produces a rising rotation curve. We conclude that 
the fitted double pseudo-Jaffe models are fairly reasonable. 

As for the cluster, recall that the host galaxy arcs require the shear to be small. At the same 
time, two independent estimates of the cluster mass suggest that the convergence from the cluster 
is Kcius ~ 0.2. First, stellar dynamical modeling of the lens galaxy constrains its mass such that 
explaining the image separation requires a cluster convergence of Kdus = 0.20 it 0.12 (Romanowsky 
& Kochanek 1999). Second, Fischer et al. (1997) and Bernstein & Fischer (1999) used weak lensing 
to measure the total mass inside an aperture centered on the lens galaxy. Subtracting the mass 
of a model lens galaxy then gives a model-dependent estimate of the remaining cluster mass (see 



eq. 10). With this technique all of our models yield 0.20 ^ f^cius 

< 0.23. 



These results are surprising because with popular cluster models it is difficult to produce a shear 
that is much smaller than the convergence (Keeton 2000). Specifically, with a singular isothermal 
ellipsoid, 7 = Kdus for all cluster positions, orientations, and axis ratios. Introducing a core radius 
allows 7 ^ Kdus ) but the shear is substantially smaller than the convergence only if the lens galaxy 
is near or within the cluster core. If the cluster halo has the "universal" or NFW density profile 
(Navarro et al. 1996), 7 ^ Kdus/'^ only inside ~0.5rs, where is the NFW scale length. For the 
observed cluster at redshift zi = 0.36 with a velocity dispersion of ~700 km s~^ (Angonin-Willaime 
et al. 1994), the NFW scale length would be ~ 100 h'^ kpc or ~30" (e.g. Navarro et al. 1996). 
Combining these results with the estimates of the cluster contribution (7 ^ 0.1 or even <^ 0.05, and 
I'^cius ~ 0.2) suggests two possible conclusions about the cluster. On the one hand, if the cluster is 
ellipsoidal it must have a core or scale radius large enough to encompass the galaxy. Having the 
lens galaxy be in the cluster core would not be too surprising since it is the brightest galaxy in 
the cluster. On the other hand, the assumption of an ellipsoidal model for the cluster may not be 
correct, perhaps because of substructure. 



5.4. Implications for the Hubble constant 

Finally, we want to combine the lens models and the observed time delay and use eq. (|^) to 
determine the Hubble constant Hq. As discussed in §3.1, the inferred value of Hq depends weakly 
on the other cosmological parameters; we quote results assuming f^o = 1 and Aq = and note that 
they would increase by 5.8% (4.5%) for an 0.^ = 0.3 open (flat) cosmology. Also, the mass sheet 
degeneracy implies Hq cx {y—Kdus) (see eq. |6|), and Kdus cannot be constrained by lens models. Thus 
if we write Hq = 100 /i km s~^ Mpc~^, lensing directly measures only the combination hj { \ — Kdus\ 
which is what we quote in Tables 3 and 4 and Figures 6 and 7. We need an independent estimate 
of Kdus in order to constrain Hq itself. 
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We saw in §5.2 that the host galaxy arcs help break the degeneracy between the lens galaxy 

shape and the cluster shear, but they do not eliminate it entirely. The remaining freedom in the 
models is small in terms of the properties of the lens galaxy but large in terms of the Hubble 
constant. Models without a twist yield 1.1 ^ /i/(l — Kdus) ^ 1-4, with a tail down to h/(l — 
i^clus) ~ 0-95 (Figure 6). Allowing a twist in the lens galaxy broadens the distribution down 
to h/{l — Kclus) ~ 0-85 (Figure 7). In other words, within the lens models there is a ±25% 
variation in inferred values for the Hubble constant; this is independent of uncertainties in Kdus- 
Most of this variation is related to a strong correlation between the lens galaxy ellipticity and the 
Hubble constant (Figures 6c, 7c, and 7e): flatter galaxies have deeper central potential wells when 
normalized to produce the same images. 

Although the model uncertainties limit our ability to use Q 0957+561 to constrain the Hubble 
constant, we might invert our thinking and ask what properties of the lens model are required to 
be consistent with local distance ladder determinations of Hq (e.g. Mould et al. 1999). Assuming 
a cluster convergence Kdus ~ 0-2 (see §5.3), obtaining Hq ^ 80 km s~^ Mpc~^ requires a lens 
galaxy that is relatively round in the center (ei ^ 0.30) and moderately flattened in the outer parts 
(0.35 ^ 62 ^ 0.56). These bounds bolster the suggestion that the lens galaxy must have a mass 
distribution with an ellipticity gradient fairly similar to that seen in the light distribution. 

6. Conclusions 

We have detected large, distorted images of the quasar host galaxy in the gravitational lens 
Q 0957+561. In the H band (rest frame R), the host galaxy appears as two long (~5") arcs stretched 
tangentially relative to the lens galaxy. Previously published models of Q 0957+561 fail to predict 
the correct shape for the host galaxy arcs. This failure rules out those models and any conclusions 
about the value of the Hubble constant drawn from them. 

The problem with the previous models is that they oversimplified the mass distribution for 
the lens galaxy, so they had to adjust the large-scale features of the model (the galaxy shape and 
the cluster shear) in order to fit small-scale constraints (the VLBI jets, with ^^0.1 milli-arcsecond 
errorbars). Without strong constraints on the global shape of the lensing potential, each class of 
models happily converged to a best-fit solution with the wrong global structure. The failure of 
these models presents two important lessons (also see Kochanek 1991; Bernstein et al. 1993; Mao Sz 
Schneider 1998; Bernstein & Fischer 1999). First, when the constraints are extremely precise it is 
necessary to include the full complexity of the structure of the lens galaxy. Second, it is important 
to carefully explore the full range of possible models before leaping to conclusions about the value 
of the Hubble constant and its uncertainties. 

Two improvements are crucial to finding better models. First, the host galaxy arcs finally 
provide enough constraints to separately determine both the lens galaxy shape and the cluster 
shear, and they must be incorporated into the modeling process. Second, models for the lens galaxy 



-20- 



must allow internal structure similar to that seen in the observed galaxy, which for Q 0957+561 
means a radially varying ellipticity and orientation. Although the mass need not exactly trace 
the light, it must be given the same freedoms lest we oversimplify the models. We introduced a 
new class of lens models, the double pseudo-Jaffe models, that smoothly incorporate an ellipticity 
gradient and isophote twist. 

Our models lead to several new conclusions about the system. First, the shape of the lens 
galaxy's mass distribution must be surprisingly similar to the shape of its luminosity distribution. 
The mass distribution must be moderately flattened and roughly aligned with the light distribution. 
Such alignment between the mass and the light is also seen in other gravitational lenses (Keeton et 
al. 1998), but was not seen in previous models of Q 0957+561. Second, the shear from the cluster 
must be small (7 ^ 0.1), in marked contrast with the moderate or strong shears seen in previous 
models (7 ~ 0.1-0.4). The small shear combined with an estimated convergence Kdus ~ 0.2 implies 
that the cluster potential must be approximately centered on the lens galaxy. Unfortunately, the 
current observations of the host galaxy cannot fully determine the lens model. There are still 
freedoms related to several direction angles that describe the cluster and to the amplitudes of the 
ellipticity and orientation gradients, and they leave a 25% uncertainty in the inferred value of the 
Hubble constant. 

However, there are two promising prospects for further improving the constraints on this 
systems. First, the substantial progress we have found is based on a 2800-second H band image. 
A deeper image with a refurbished NICMOS camera should show a complete Einstein ring image. 
Since an Einstein ring probes the potential all the way around the lens galaxy, it is extremely 
useful for determining both the lens model and the intrinsic shape of the host galaxy (see Keeton, 
Kochanek &; McLeod 2000). Filling in the gaps in the Q 0957+561 ring should eliminate the 
remaining uncertainties in the models. Second, X-rays from the cluster gas have been detected 
(Chartas et al. 1998), but the image resolution was poor and the signal was dominated by X- 
rays from the quasar images. New high-resolution X-ray observations to map the cluster gas would 
constrain the cluster potential and help determine the cluster angles that are still unknown (namely 
the angle to the cluster center and the angle of the cluster's density gradient at the lens). Combining 
deeper infrared and X-ray imaging would thus dramatically improve the constraints on the models 
and allow consistency checks of the cluster potential. Improved X-ray imaging would also further 
improve the mass estimates of the cluster for breaking the mass sheet degeneracy. 

Distorted images of the host galaxy have now been observed in four of the time delay lenses 
(Q 0957+561, PG 1115+080, B 1600+434, and B 1608+656; see Impey et al. 1998; Kochanek et al. 
1999). As we have illustrated, images of the host galaxy are a powerful constraint on models of the 
system and hence on the uncertainties in the value of Hq derived from the time delay measurements. 
To date, host galaxies have been thought of mainly as pleasant bonuses in relatively shallow images 
targeting the lens galaxies. Taking full advantage of the host galaxies will require deeper images 
focused on the host galaxies themselves. Such images hold great promise for breaking common 
degeneracies in lens models and allowing gravitational lensing to map in detail the mass distributions 
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of distant galaxies, to probe the potentials of lens galaxy environments, and to determine a robust 
and independent measurement of the Hubble constant at cosmological distances. 
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A. Appendix: Comparing lens models 

Table 3 summarizes results from models of Q 0957-1-561 by Grogin & Narayan (1996), Barkana 
et al. (1999), Bernstein & Fischer (1999), and Chae (1999). The various authors reported results 
in different ways. Sorting out the differences can be confusing, so we converted all the results to a 
standard form and explain the conversions here. 

First, Bernstein & Fischer (1999) and Chae (1999) quoted an ellipticity parameter e that is 
related to the axis ratio qhy q'^ = (1 — e)/(l-|-e). We report the true ellipticity e = l — q. (Bernstein 
& Fischer called their ellipticity parameter e, but it played the same role as e here.) 

Second, different authors quoted results for the Hubble constant in different ways. In general, 
lens models can determine only the combination h/{l — Kdus 

) where h = ifo/(100km s"^ Mpc"^) 
and Kcius = ^cius/^crit is the surface mass density of the cluster (in critical units) at the position 
of the lens. Models that use a multipole expansion for the cluster (an external shear or 3rd order 
cluster, see eq. ^) offer no constraints on Kdus- Models that use an actual mass distribution for 
the cluster do predict Kdus'- a-n isothermal sphere that has core radius s and is located a distance d 
from the lens produces Kdus = ^/(^O ^'^d shear ^dus = 

{h(P)/[2i{s + if], where ^ = Vs^ + d? and 
6 is a mass parameter such that M{R) = 'KTicrit b{Vs'^ + — s). However, the Hdus prediction 
is not unique because varying the cluster's shape and profile can change Kdus (and ^ydus) without 
changing the goodness of fit (see Chae 1999 for examples). Hence we describe these cluster models 
in terms of an equivalent external shear with magnitude jeS = 7d«s/(l ~ i^dus) (see Grogin &: 
Narayan 1996; Barkana et al. 1999). In Table 3 we quote the magnitude of the external shear (7, 
or 7eff in the isothermal sphere cluster models) and the Hubble constant combination h/{l — Kdus)- 
We note that if the cluster is a singular isothermal sphere, (1 — Kdus) = (1 + 7efr)~^ (see Barkana 
et al. 1999). The estimates of h/{l — Kdus) can be translated into actual estimates for Hq using 
the weak lensing measurement of the cluster mass (see §3.2). 

Third, different authors used slightly different definitions of x^- We did not convert values; 
we used what the authors reported. 

Finally, we corrected several apparent typographical errors. Barkana et al. (1999) claimed that 
their galaxy orientation angle 9 was a position angle, but in fact the position angle was 90° — 9 
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and this is what we quote. Bernstein & Fischer (1999) quoted angles measured from the positive 
X-axis. Their formahsm is self-consistent, except that their eq. (16) needs a minus sign in front of 
the 7 term. We report the Bernstein & Fischer (1999) angles as position angles. Finally, for the 
Bernstein & Fischer (1999) 3rd order cluster models there appears to be a difference of a factor of 
2 between the amplitude of 3rd order cluster term as written in their eq. (16) and as reported in 
their Table 3. We quote values consistent with their eq. (16). 



B. Appendix: The pseudo-Jaffe ellipsoid 

A standard Jaffe (1983) model has a 3-dimensional density distribution p oc (m + a)~^ 
where ra is an ellipsoidal coordinate and a is the break radius. For lensing it is more convenient to 
use a modified density distribution p oc {m? + s^)~^ {m? + a?)~^ where a is again the break radius, 
and we have added a core radius s < a. For this model the projected surface mass density, in units 
of the critical surface density for lensing, is 

S _ b 

where the mass normalization parameter b is chosen to match the lensing critical radius in the 
limit of a singular isothermal sphere (s — > 0, a ^ oo, q ^ 1). Eq. (|B1| ) defines what we call the 
pseudo-Jaffe ellipsoid. Its projected surface density is roughly constant for R ^ s, falls as R^^ for 
s <^ R ^ a, and falls as R^"^ for R ^ a; its total mass is M = 7:Tjcrit<lb{a — s). The ellipsoid 
coordinate m can be written in terms of the projected axis ratio q and the position angle PA 
(measured East of North) as 



2 2\"-'^/^ / 2 2\^-^/^ 

m + s ] — [m + a 



(Bl) 



i?2 



(l + q^) + (l - Q^) cos 2 {6- PA)] . (B2) 



2q^ 

The pseudo-Jaffe lens model is easy to compute because it is written as the difference of two 
softened isothermal ellipsoids, whose analytic lensing properties are known (Kassiola &: Kovner 
1993; Kormann, Schneider & Bartelmann 1994; Keeton & Kochanek 1998). This model has been 
used previously by de Zeeuw &; Pfenniger (1988), Brainerd, Blandford &: Small (1996), and Keeton 
& Kochanek (1998). 

In the limit of a singular model (s = 0), the pseudo-Jaffe model is an example of a general class 
of "cuspy" lens models with p oc -^{m^ + a2)(')'-'')/2. These models are a more realistic family 
for real galaxies than the softened power law models, and are discussed by Muiioz, Kochanek & 
Keeton (2000). 

The pseudo-Jaffe model is related to the King model used in FGS lens models. The standard 
approach is to approximate the King model with a combination of isothermal models (Young et al. 
1980), 

E 2.125 1.756 ^^^^ 



-'crit 
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The King model has a single scale radius r^, while the pseudo-JafFe model has independent core 
and break radii. Also, in the King model the coefficients of the two terms differ, while in the 
pseudo-Jaffe model they are the same. 
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TABLE 1 
Summary of Observations 



UT Date 


Camera/Filter 


Duration 


N/Dith 


Detect (mag/D") 


Source 


1998.05.30 


NIC2/F160W 


700 s 


4/3'.'0 


20.7 


GO-7887, Falco 


1995.11.19 


WFPC2/F814W 


160-900S 


2/0'.'2 


22.6 


GO-5979, Rhee 


1995.11.19 


WFPC2/F555W 


2300 s 


14/2'.'2 


25.2 


GO-5979, Rhee 



Note. — "Duration" refers to individual exposures. "N/Dith" indicates the number of expo- 
sures and largest dither separation. The detection limit corresponds to 3(T above the sky. 



TABLE 2 

ASTROMETRY AND PHOTOMETRY 



ID 


z 


RA 


Dec 


II 


V-I 


V-II 


I-II 


Re 


1 - hi a 


PA 






(") 


(") 


(mag) 


(mag) 


(mag) 


(mag) 


(") 




(°) 


A 


1.41 


=0 


=0 


1 5.60 ± 0.03 


0.38 ±0.13 


1.49 ± 0.09 


1.11±0.10 








B 


1.41 


1.229 ± 0.005 


-6.048 ±0.004 


15.68 ± 0.03 


0.34±0.11 


1.43 ± 0.07 


1.09 ± 0.09 








Gl 


0.36 


1.406 ± 0.006 


-5.027 ± 0.005 


15.14± 0.09 


1.95 ±0.07 


3.92 ± 0.07 


1.97 ± 0.04 


2.23 ± 0.20 


0.20 ±0.01 


44 ± 3 


G2 




8.ai6± 0.016 


-2.942 ±0.005 


17.92 ± 0.04 


1.90 ±0.35 


3.96 ± 0.25 


2.06 ± 0.24 


0.43 ± 0.02 


0.45 ±0.03 


62 ± 2 


G3 




5.04 ±0.01 


0.63 ±0.02 


21.5 ±0.2 


1.8 ±0.2 


3.3 ±0.2 


1.5 ±0.2 


0.10 ± 0.03 


0.17±0.04 


86± 36 


G4 




-0.551 ±0.009 


-5.498 ±0.009 


22.2 ±0.1 


1.6 ±0.3 


3.8 ±0.3 


1.7 ±0.1 


=0 







Note. — The pholomelric siriicl.nre (Re, 1 — hja. PA) of (lie lens galaxy Gl comes from a simple de Vaiicoiileiirs model, neglecling the 
radially varying ellipticity and major axis fonnd by Bernstein et al. (1997). The magnitude uncertainties (column 5) include the uncertainties 
in the photometric structure of the galaxy, while the color uncertainties (columns 6-8) are determined for a fixed photometric structure. 
Object G4 was unresolved, so we fit it with a point source. G3 and G4 correspond to faint objects labeled Blobs 7 and 1 (respectively) in the 
deep optical images of Bernstein et al. (1997). 



TABLE 3 
Historical Models of Q 0957+561 



Model(i) 






Cluster'^') 


Elliptirity 




T^adial 


Structure 










7 




Misc. 


e 


PA (°) 






Misc. 




"^arc / ( 




Grogin/SPLS 


0.298 


()7.-1 








1.08 




Core radius 0"058 


41.4/6 


5.71 


0.82 


Barkana/SPLS 


0.258 


()6.4 








1.16 




Singular 


253/11 


4.99 


0.78 


Bernstem/SPLS+C2 


0.214 


64.0 








1.16 




Singular 


87.9/7 


2.91 




Barkana/SPEMD 


0.364 


51.9 




0.531 


-56.2 


1.01 




Core radius 0"17 


89.1/9 


6.08 


0.73 


Chae/SPEMD+CL 


0.128 


51.8 


d = Q", s = 4'.'92 


0.170 


64.0 


1.28 




Singular 


49.6/25 


2.24 


0.74 


Bernstem/DM1+C2 


0.377 


60.0 




0.500 


-33.4 


1.19 




Singular 


36.4/6 


5.35 




Bernstein/DMl+CSS 


0.135 


49.0 


a- = 0.0078, e„ = 79?0 


0.358 


77.7 


1.17 




Singular 


6.1/4 


3.57 


0.92 


Bernstem/DM2+C2(*^) 


0.142 


57.0 




0.333 


-80.3 


1.26 




Singular, break 


7.2/2 


3.91 












0.170 


89.7 


1.34 




radius 1"06 








Grogiii/FGS 


0.377 


67.8 










2.54 


Scale radius 0'.'63 


19.0/5 


7.04 


0.94 


Barkana/FGS 


0.209 


65.0 










10.7 


Scale radius 2'.'58 


270/10 


10.83 


0.79 


Barkana/FGSE 


0.083 


63.4 




0.555 


67.3 




9.25 


Scale radius 1"59 


48.0/8 


2.70 


1.33 


Barkana/FGSE+CL 


0.122 


63.3 


d = 15'.'34, s = 


0.495 


61.9 




6.99 


Scale radius 1"39 


41.3/7 


2.24 


1.27 



Note. — We converted results quotrd by the various authors into a standard form, and we corrected several apparent t3'pogTaphical errors in the original 
papers. See Appendix A for details. 

(^'The various types of models are described in §§3.4 and 3.5. Galaxy models: SPLS = power law sphere; SPEMD = DM1 = power law ellipsoid; DM2 = 
broken power law model with one power law ellipsoid inside the break radius and an independent power law ellipsoid outside the break radius (the density 
is discontinuous across the break). Cluster models: no specification = C2 = external shear; C3S = 3rd order Taylor series; CL = isothermal sphere. The 
references are Grogin & Narayan (1996), Barkana et al. (1999), Bernstein & Eischer (1999), and Chae (1999). 

(^^7 and refer to the magnitude and position angle (North through East) of the external shear. Eor a 3rd order cluster (C3S), cr and 0„ give the 
magnitude and position angle of the 3rd order terms in the potential (see §3.5). For an isothermal sphere (CL), s is the core radius, d and 6^ are the distance 
and direction of the cluster from the lens galaxj'. and 7 is the amplitude of an equivalent external shear (see Appendix A). 

(^'Power law exponent in the sense S(i?) ~ R"^"^ or M(R) ~ R". 

(■^'Mass of central point mass in EGS models, in units of 10^° h^^ Mq. 

(^'Different studies used different definitions of x^, and we quote the results directly without trying to convert them to a standard form. 
(^'Our calculation of the contribution from the arc constraints, as defined in §3.3. 

(^'implications for Ho- In general the lens models determine only h/(l — K^im) where h = i?o/(100km s^^ Mpc^"') and «■<.;„., = S,,;,,., /S<,„7 is the surface 
mass density of the cluster (in critical units) at the position of the lens. If the cluster is a singular isothermal sphere, (1 — k^Ius) = (1 + 7) (see the 
.Appendix). These can be converted into Hq estimates using the weak lensing mass estimate (see §3.2). Bernstein & Fischer (1999) gave Hq results for the 
DM1+C3S model, but not for the other models quoted here. 

'^'The two sets of parameters refer to the inner and outer power law ellipsoids. 



TABLE 4 

Sample Double Pseudo-Jaffe Lens Models 



Component 


Normalization 


Core Kadius 


Cnt-ofT Radius 


Ellipticitv 


Orientation 


Cluster 


Cluster 






h/(l - k) 


Comment 




b (") 


(") 


(") 


e 


PA(») 


Amplitude 


Angle (°) 










inner 


12.06 


[0] 


0.117 


0.470 


58.37 


7 = 0.004 


0^ = 63.47 


flux- 


1.76 




Case la: 


outer 


5.52 


1.124 


[30] 


0.470 


58.37 


a- = 0.013 


0a = 46.07 


blobs: 


2.42 


1.041 


no twist, 


















nrr: 


1.15 




best xlrc 


inner 


13.83 


[0] 


0.101 


0.418 


61.26 


7 = 0.004 


0^ = 51.28 


nux: 


0.69 




Case lb: 


outer 


5.77 


1 .490 


[30] 


0.418 


61.26 


a = 0.014 


0a = 39.04 


blobs: 


6.17 


0.944 


no twist. 


















arc: 


1 .30 




lowest ffo 


inner 


3.89 


[0] 


3.596 


0.614 


58.30 


7 = 0.005 


0., = 69.01 


flux: 


5.82 




Case Ic: 


outer 


5.24 


1.783 


[30] 


0.614 


58.30 


a- = 0.011 


0a = 73.74 


blobs: 


6.55 


1.361 


no twist, 


















arc: 


1.28 




highest Ho 


inner 


2.51 


[0] 


0.532 


0.053 


64.59 


7 = 0.011 


0^ = 50.52 


flux: 


3.66 




Case 2a: 


outer 


5.62 


1.570 


[30] 


0.439 


61.06 


a- = 0.015 


0a = 39.14 


blobs: 


6.51 


0.971 


twist, 


















arc: 


1.11 




best xlrc 


inner 


1.84 


[0] 


1.651 


0.004 


49.02 


7 = 0.013 


0^ = 45.25 


flux: 


5.05 




Case 2b: 


outer 


7.27 


3.363 


[30] 


0.449 


61.30 


a- = 0.014 


0a = 39.39 


blobs: 


6.45 


0.852 


twist. 


















arc: 


1.50 




lowest Ho 


inner 


4.67 


[0] 


3.763 


0.691 


56.12 


7 = 0.022 


0^ = 40.49 


flux: 


5.33 




Case 2f: 


outer 


5.26 


1.931 


[30] 


0.573 


61.75 


a- = 0.015 


0a = 80.99 


blobs: 


7.07 


1.379 


twist. 


















arc: 


1.32 




highest Ho 



Note. — Parameters for sample models with radial ellipticity and orientation variations ("twist") or without ("no twist"). In all models, [• • •] denotes a scale 
length held fixed (see §3.5). 
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Fig. 1. — (a) The raw F160W (H band) image of the Q 0957+561 system, using a logarithmic 
grayscale. The lens galaxy (Gl) and quasar images (A and B) are labeled. The tickmarks on 
the frame are spaced by 1". (b) The residual image after subtracting the two quasars and the 
lens galaxy, plotted with a linear grayscale. The holes in the bright regions of the arcs are due to 
imperfect subtraction of the quasar images, (c) The residual image convolved with a '^0'.'076 (1 
pixel) FWHM Gaussian to enhance the visibility of low surface brightness features. The dotted 
contours are drawn at 2a above the sky. The rectangular mask around the A arc shows the region 
used to construct the unlensed source, while the polygonal mask around the B arc shows the region 
used to compute x^^^, (see §3.3). 



Fig. 2. — The observed (grayscale) and predicted (contours) arc structures for power law galaxy 
models. The grayscale is linear, and the solid contours are drawn at 2, 6, 10, . . . x cr above the sky. 
The dotted contours are drawn for the observed image at 2(7. The tickmarks on the frames are 

again spaced by l". We show the Grogin & Narayan (1996) SPLS model (top left), the Barkana et 
al. (1999) SPLS (top right) and SPEMD (bottom right) models, and the Chae (1999) SPEMD+CL 
model (bottom left). The galaxy is circular in the SPLS model and elliptical in the SPEMD models. 
The cluster is treated as an external shear in all models except for Chae/SPEMD+CL, in which it 
is treated as a softened isothermal sphere. 



Fig. 3. — Power law galaxy models from Bernstein &; Fischer (1999). The models are: SPLS+C2 
(top left), DM1+C2 (top right), DM1+C3S (bottom left), and DM2+C2 (bottom right). For the 
galaxy, SPLS indicates a circular power law, DM1 indicates an elliptical power law, and DM2 
indicates an elliptical broken power law. For the cluster, C2 refers to a 2nd order multipole 
expansion for the cluster (an external shear), and C3S refers to a 3rd order expansion. 



Fig. 4. — FGS-type models. We show the Grogin & Narayan (1996) FGS model (top left), as weU 
as the Barkana et al. (1999) FGS (top right), FGSE (bottom left), and FGSE+CL (bottom right) 
models. FGS refers to a circular galaxy and FGSE refers to an elliptical galaxy; both represent the 
cluster as an external shear. The FGSE+CL model represents the cluster as a singular isothermal 
sphere. 
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Fig. 5. — The intrinsic source structure inferred for two sample lens models. The grayscale and con- 
tours are the same as in Figures 2-4, and again the tickmarks on the frames are spaced by 1". The 
heavy curves show the lensing caustics. The cuspy diamond-shaped curve is the tangential caustic, 
and the round curve in the Barkana/SPEMD model is the radial caustic. In the Barkana/FGSE+CL 
model, the radial caustic is formally at infinity because of the point mass. A source inside the tan- 
gential caustic produces 4 images, between the tangential and radial caustics produces 2 images, 
and outside the radial caustic produces 1 image. The source for the Barkana/FGSE+CL model is 
offset because of the deflection produced by the isothermal cluster model. 
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Fig. 6. — Results for models without radial ellipticity and orientation variations. The points 
indicate the values for the parameters, Xarc ^'^d Hubble constant for individual models, while 
the histograms show the corresponding distributions (normalized to have the same peak value). 
We quote Hubble constant results in terms of the quantity /i/(l — Kc/us) that is constrained by 
the lens models; the Hubble constant histograms are the same in all panels. The vertical dashed 
lines indicate the parameter ranges we examined, except the range for is outside the figure 
(0° <9c< 90°). There were no explicit bounds placed on the cluster amplitudes 7 and a. 
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Fig. 7. — Similar to Figure 6 but for models with radial ellipticity and orientation variations. 



Fig. 8. — The observed and predicted arc structures for four of the new models given in Table 
4. Cases la and lb refer to the no-twist models that have the best fit to the host galaxy arcs 
and the lowest value of Hq, respectively. Cases 2a and 2b refer to the twist models with the same 
properties. 
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Fig. 9. — Estimates of the equatorial circular velocity profile for the models given in Table 4, 
assuming that the 3-d mass distributions are oblate spheroids viewed edge-on. The dotted and 
dashed curves show the contributions from the inner and outer pseudo-Jaffe components, respec- 
tively, and the solid curves show the total rotation curves. For reference, the quasar images A and 
B are 5'.'2 and I'.'O from the center of the lens galaxy. Note that the high inferred circular velocities 
{vc ~ 600 km s~^) are misleading, as explained in the text. 
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